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Abstract. In this paper we prove the decidability of the HDOL ultimate pe- 
riodicity problem. 



1. Introduction 

1.1. The HDOL ultimate periodicity problem. In this paper we prove the 
decidability of the following problem : 

Input: Two finite alphabets, a substitution a : A* — > A* prolongable on the letter 
a, a morphism <fi : A* — > B* . 

Question: Do there exist two words u and v such that the morphic sequence 
</>((T w (a)) is equal to uv^ (i.e., is ultimately periodic)? 

We will refer to it as the HDOL ultimate periodicity problem. 

Theorem 1. The HDOL ultimate periodicity problem is decidable. 

This result was announced in |Durand preprint 201 1| . While we were ending the 



redaction of this paper, I. Mitrofanov put on Arxiv Mitrofanov preprint 2011] an- 
other solution of this problem. 
This problem was open for about 30 years. 

In 1986, positive answers were given independently for DOL systems (or purely 
substitutive sequences) in both |Harju and Linna 1986| and [Pansiot 19 86 . and, 
for automatic sequences (which are particular HDOL sequences) in [Honkala 1986 . 
Other proofs have been given for the DOL case in [Honkala 2008 and for automatic 



sequences in |Allouche7 Rampcrsad and Shallit 2009 



Recently in Durand preprint 2011 the primitive case has been solved. 
In |Honkala and Rigo 2004| is given an equivalent statement of the HDOL ulti- 
mate periodicity problem in terms of recognizable sets of integers and abstract 
numeration systems. In fact, J. Honkala already gave a positive answer to this 
question in [Honkala 1986j but in the restricted case of the usual integer bases, 
i.e., for fc-automatic sequences or constant length substitutive sequences. Recently, 
in |Bell, Charlier, Fraenkel, and Rigo 2009] , a positive answer has been given for a 
(large) class of numeration systems including for instance the Fibonacci numeration 
system. 

Let us point out that the characterization of recognizable sets of integers for ab- 



stract numeration systems in terms of substitutions given in [Maes and Rigo 2002 



(see also |Lecomte and Rigo 2010| ), together with Theorem [3 provides a decision 
procedure to test whether a recognizable set of integers in some abstract numeration 
system is a finite union of arithmetic progressions. 
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1.2. Organization of the paper. In Section [2] are the classical definitions. The 
proof of the HDOL ultimate periodicity problem is in Section[3] It could be sketched 
as follows. First we recall some primitivity arguments about matrices and substitu- 
tions. The "best or easiest situation" is when we deal with growing substitutions and 
codings (letter-to- letter morphisms) . It is known that we can always consider we are 
working with codings (see c Cobham 1968, Pansiot 1983| lAflouche an d Shallit ~2003[ 
|Cassaigne and Nicolas 2003[ IHonkala 2009] ). But from these proofs it is not clear 
that this could be algorithmically done. Thus we provide an algorithm using the 
proof of [Cassaigne and Nicolas 2003 where we replace some (non-algorithmic) ar- 



guments (Lemma 2, Lemma 3 and Lemme 4 of this paper) by algorithmic ones. 
Then we show with simple and algorithmic arguments that we can suppose we are 
working with non-erasing substitutions. 

We treat separately growing and non-growing substitutions. For growing substi- 
tutions we look at their primitive components and we use the decidability result 
established in |Durand preprint 2011 about periodicity for primitive substitutions. 



Indeed, these primitive components should generate periodic sequences. Hence, we 
check it is the case (if not, then the sequence is not ultimately periodic). From 
there, Lemma fTTI allows us to conclude. 

For the non-growing case we use a result of Pansiot P ansiot 1984) saying that we 
can either consider we are in the growing case or there are longer and longer periodic 
words with the same period in the sequence. From there, we again conclude with 
Lemma [TT] 

1.3. Questions and comments. We did not compute the complexity of the algo- 
rithm provided by our proof of the HDOL ultimate periodicity problem. Looking at 
Proposition @] and the results in [D urand preprint 201 1] we use here, our approach 
provides a high complexity. 

Our result is for one-dimensional sequences. What can be said about multidimen- 
sional sequences generated by substitution rules ? or self-similar tilings ? It seems 
hopeless to generalize our method to tilings, although the main and key result 
we use to solve the HDOL ultimate periodicity problem (that is, the main result 
in [Durand 1998] . see [Durand preprint 201 1| ) has been generalized to higher di- 
mensions by N. Priebe iiT jPriebe 2000j fsee also |Priebe and Solomyak 2"00T| ). But 
observe that in [Leroux 2005] the author gives a polynomial time algorithm to know 
whether or not a Number Decision Diagram defines a Presburger definable set (see 
also [Muchnik 2003] where it was first proven but with a much higher complex- 
ity) . From this result and |Cerny and Gruska 1986a[ ISalon 19861 ISalon 1987) it is 
decidable to know whether a multidimensional automatic sequence (or fixed point 
of a multidimensional "uniform" substitution) has certain type of periodicity (see 
[Leroux 2003} IMuchnik 2003) 1. From |Durand and Rigo preprint 2011| this type of 
periodicity is equivalent to a block complexity condition. 

2. Words, morphisms, substitutive and HDOL sequences 

In this section we recall classical definitions and notation. Observe that the notion 
of substitution we use below could be slightly different from other definitions in the 
litterature. 



2.1. Words and sequences. An alphabet A is a finite set of elements called letters. 
Its cardinality is A word over A is an element of the free monoid generated 
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by A, denoted by A*. Let (with Xi G A, < i < n - 1) be a 

word, its length is n and is denoted by |a;|. The empty word is denoted by e, |e| = 0. 
The set of non-empty words over A is denoted by A + . The elements of A N are 
called sequences. If x = x xi ■ ■ ■ is a sequence (with Xi G A, i G N) and I = [k, I] 
an interval of N we set xj = XkXk+i • • ■ x\ and we say that xi is a factor of x. If 
k = 0, we say that Xi is a prefix of x. The set of factors of length n of x is written 
C n (x) and the set of factors of x, or the language of x, is denoted by C{x). The 
occurrences in a; of a word u are the integers i such that = u. If u has 

an occurrence in x, we also say that u appears in x. When x is a word, we use the 
same terminology with similar definitions. 

The sequence x is ultimately periodic if there exist a word u and a non-empty word 
v such that x = uv" , where n u = www • • • . In this case v is called a word period and 
|w| is called a length period of x. It is periodic if u is the empty word. A word u is 
recurrent in ir if it appears in x infinitely many times. 

2.2. Morphisms and matrices. Let A and B be two alphabets. Let a be a 
morphism from A* to £?*. When u(A) = B, we say a is a coding. We say cr is 
erasing if there exists b £ A such that cr(6) is the empty word. Such a letter is 
called erasing letter. If cr(A) is included in B + , it induces by concatenation a map 
from ^4 N to B N . This map is also denoted by a . With the morphism a is naturally 
associated its incidence matrix M a = (mij)i<=B,jeA where rriij is the number of 
occurrences of i in the word cr(j). 

Let a be an endomorphism. We say it is primitive whenever its incidence matrix is 
primitive (i.e., when it has a power with positive coefficients). We denote by (cr) 
the set of words having an occurrence in some image of cr™ for some n G N. 

2.3. Substitutions and substitutive sequences. We say that an endomorphism 
a : A* — > A* is prolongable on a G A if there exists a word u G A + such that 
a(a) = au and moreover, if lim„^ +00 |cr™(a)| = +oo. Prolongable cndomorphisms 
are called substitutions. 

We say a letter b G A is growing (w.r.t. cr) if lim n ^ +00 |ct™(6)| = +oo. We say a is 
growing whenever all letters of A are growing. 

Since for all n G N, cr n (a) is a prefix of a n+1 (a) and because |tr™(a)| tends to infinity 
with n, the sequence (cr n (aaa ■ ■ ■ ))„>o converges (for the usual product topology 
on A N ) to a sequence denoted by cr w (a). The endomorphism a being continuous for 
the product topology, cr w (a) is a fixed point of cr: o^cr^a)) = cr w (a). A sequence 
obtained in this way (by iterating a prolongable substitution) is said to be purely 
substitutive (w.r.t. cr). If x G A N is purely substitutive and <j> ■ A* — > £>* is a 
morphism then the sequence y = 0(x) is said to be a morphic sequence (w.r.t. 
(cr, </>)). When </> is a coding, we say y is substitutive (w.r.t. (cr, <f))). 
The language of cr : A* — > A*, denoted by (cr), is the set of words having an 
occurrence in a n (b) for some n G N and G A. 

2.4. DOL and HDOL sequences. A DOL system is a triple G = (^4, h, u) where 
A is a finite alphabet, a : A* — > A* is an endomorphism and u is a word in A* . A 
HDOL system is a 5-tuple G = (A, B, cr, 0, u) where (A, h, u) is a DOL system, B is 
a finite alphabet and <j) '■ A* — >• £>* is a morphism. 

If the sequence (a n (uuu ■ ■ ■ ))„ (resp. (<f>(a n (uuu ■ ■ ■ )) converges in A N (resp. -B N ), 
we denote its limit by w(G). It is not difficult to prove that w(G) is a morphic 
sequence and that this can be algorithmically shown. 
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It is important to note that it is decidable to know whether w(G) exists. This is 
left as an exercise. Thus, the HDOL ultimate periodicity problem also holds for the 
sequences w(G) where G is a HDOL system. 

3. Ultimate periodicity of HDOL sequences 

In the sequel a : A* — >• A* is a substitution prolongable on a, cj> : A* — » B* is 
a morphism, y = o~ u '(a) and x — 4>{y) is a sequence of B N . We have to find an 
algorithm telling if x is ultimately periodic or not. 

3.1. Primitiveness assumption and sub-substitutions. We recall that the 
HDOL ultimate periodicity problem is already known in the primitive case. 

Theorem 2. Durand pre print 201 1] The HDOL ultimate periodicity problem is 
decidable in the context of primitive substitutions. Moreover, a word period can be 
explicitly computed. 

Proof. The first part is Theorem 26 in |Durand preprint 201 1] . The second part 
can be easily deduced from the proof of this theorem. □ 

The following lemma shows that is is decidable to check that a nonnegative matrix 
is primitive. 

Lemma 3. [Horn and Johnson 1990] The nxn nonnegative matrix M is primitive 
if and only if M n ~ 2n + 2 has positive entries. 

From Lemma [31 Section 4.4 and Section 4.5 in |Lind a nd Marc us~1995j we deduce 
following proposition. 

Proposition 4. Let M = (mij)ij<£A be a matrix with non-negative coefficients. 
There exists three positive integer p ^ 0, q, I, where q < I — 1, and a partition 
{Af, 1 < i < 1} of A such that 



M p = 
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where the matrices Mi have only positive entries or are equal to zero. Moreover, 
the partition and p can be algorithmically computed. 

Thus we have the following corollary whose proof can be given without to use this 
proposition. 

Corollary 5. One can decide whether a substitution has erasing letters and non- 
growing letters. 

The following corollary will be helpful to change the representation of x (in terms 
of (a, (j))) to a more convenient representation. 
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Corollary 6. Let r : A* — > A* be an endomorphism whose incidence matrix has 
the form of M p in Proposition^ Then for allb £ A and all j > 1 the letters having 
an occurrence in (cr'" 4 ')"' (b) or (cr\ A \y + (b) are the same. 

In what follows we keep the notation of Proposition 01 We will say that {Ai\ 1 < 
i < 1} is a primitive component partition of A (with respect to M), the A4 being 
the primitive components. If i belongs to {q + 1, • • • , /} we will say that A, is a 
principal primitive component of A (with respect to M). 

Let r : A* — > A* be a substitution whose incidence matrix has the form of M p in 
Proposition SJ Let i G {q + 1, • ■ • ,1}. We denote Ti the restriction tm 4 : ^4* — >• 
A* of t to ^4*. Because Ti(Ai) is included in A* we can consider that Tj is an 
endomorphism of A* whose incidence matrix is Mi. When it defines a substitution, 
we say it is a sub-substitution of r. Moreover the matrix Mi has positive coefficients 
which implies that the substitution Tj is primitive. 

A non-trivial primitive endomorphism always has some power that is a substitution. 
For non-primitive endomorphisms we have the following corollary. 

Corollary 7. Let r : A* — > A* be an endomorphism whose incidence matrix has 
the form of M p in Proposition [7} Then, with the notation of Proposition [^J there 
exists k < \A\\ A \ satisfying : for all i > q + 1 such that Mi is neither a null matrix 
nor the lxl matrix [1], the endomorphism T* is a (primitive) substitution for some 
letter in Ai . 

The following lemma is easy to establish. 

Lemma 8. Let x — 4>{o~ u} (a)) . If x — uv^ , where v is not the empty word, then 
each sub- substitution a' of a such that {a') C (er w (a)) verifies 4>((a')) — (v u ). 

3.2. Reduction of the problem. It may happen, as for a defined by a ab, 
b I— > a, c i-> c, that some letter of the alphabet, here the alphabet is {a,b,c}, does 
not appear in a u (a). It is preferable to avoid this situation. Corollary [6] enables us 
to avoid this algorithmically. Indeed, from PropositionHJ taking a power of a (that 
can be algorithmically found) if needed, we can suppose 

(PI) the incidence matrix of a has the form of M p in Proposition [4j 

Then, consider cr' A ' instead of a. Hence, a will continue to satisfy (PI) and, from 
Corollary, [S] we have 

(P2) for all b 6 A and all j > 1 the letters having an occurrence in a^(b) or 
cr :,+1 (6) are the same. 
Let A' be the set of letters appearing in a u {a). From (P2) it can be checked that 
o~(A') is included in A'* and that the set of letters appearing in cr(o) is A'. Thus 
a', the restriction of a to A', defines a substitution prolongable on a satisfying 
a ,u) {a) = o- u (a) such that all letters of A' have an occurrence in a' u (a) and all 
letters of A' occur in a' (a). 

When we work with morphic sequences it is much simpler to handle non-erasing 
substitutions and even better to suppose that is a coding. Such a reduction is 
possible as shown in |Cassaigne and Nicolas 2003| . 

Theorem 9. Let x be a morphic sequence. Then, x is substitutive with respect to 
a non-erasing substitution. 

This result was previously proven in [Cobham 1968] and [Pansiot 1983] (see also 
jAllouche and Shallit 2003] and [Honkala 2009]). But here as we are interested in 
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decidable issues, we need more. The proof of J. Cassaigne and F. Nicolas is short and 
inspired by |Durand 1998] . This is the second part of this proof that corresponds to 
[Durand T9 98 . It is clearly algorithmic. Whereas the first part (Lemma 2, Lemma 
3 and Lemma 4 of [Cassaign e and Nicolas 2003| ) is not because it uses the fact that 
from any sequence of integers, we can extract a subsequence that is either constant 
or strictly increasing. They use these lemmas to show the key point of their proof : 
we can always suppose that <f> and a fulfill the following : 

(3.1) \0(<?(a))\ > \<l>(a)\ > and \<f>(a(b))\ > \4>(b)\ for all b g A. 

Below we show that this can be algorithmically realized. This provides an algorithm 
for Theorem [9] 

First let us show that a can be supposed to be non-erasing. 

As a satisfies (P2), each letter e is either erasing or, for all I, a l (e) is not the empty- 
word. Let A' be the set of non-erasing letters and A" the set of erasing letters. Let 
if) be the morphism that sends the elements of A" to the empty word and that is 
the identity for the other letters. Then we define a' to be the unique endomor- 
phism defined on A' satisfying ipa = a'tp. Observe that a' is easily algorithmically 
definable and prolongable on a. Moreover we have aip = o. Let z = o~ ,LJ (a). Then, 
il>(y) = z and <j{z) = y. 

If a' is non-erasing (which can be checked using Proposition [4j then we are done. 
Otherwise, we proceed in the same way with a' . As the cardinality of the alphabet 
strictly decreases we will end, in at least \ A\ steps of the procedure, with a morphism 
ip and a non-erasing substitution r : A* — > A* prolongable on a such that x — ip(t) 
where t = r w (a). 

Note that r keeps properties (PI) and (P2) a already had. Thus we can also 
consider 

(P3) a is non-erasing. 

Consequently, from (P2), \(f> o a{a{a))\ > \4>{a{a))\ > \4>(a)\, otherwise r w (a) would 
not be an infinite sequence, and \4> o a(a(b))\ > \<fi(a(b))\ for all b £ A. Incidently, 
it implies that |</>(cr(a))| > 0. Hence changing (f> by (/) o a if needed, we can suppose 
the following inequalities holds : 

\(f>(<r(a))\ > |0(o) | > and |^(r(6))| > \<p(b)\ for all b g A. 

Hence, together with the argument of the proof of Theorem [9] we obtain the algo- 
rithm we are looking for. This is summarized in the following theorem. 

Theorem 10. There exists an algorithm that given <fi and a compute a coding 
ip and a non-erasing substitution r, prolongable on a, such that x = tp(z) where 
z = T u {a). 

Thus, in the sequel we suppose is a coding and a is a non-erasing substitution. 
We end this section with a technical lemma checking the ultimate periodicity. 

Lemma 11. Let t g A N , ip be a coding defined on A* , z = ip(t), and, u and v be 
non-empty words. Then, z = uv" iff and only if 

(1) for all recurrent words B = &1&2 • • • b%\v\ £ (f)> where the bi's are letters, 
if(B) = sbv Tb Pb where rs is a positive integer, sb is a suffix of v and ps 
a prefix of v; 
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(2) for all recurrent words BB' S (t), where B and B' are words of length 2\v\, 
PBSB' = v. 

Proof. The proof is left to the reader. □ 

3.3. The case of substitutive sequences with respect to growing substi- 
tutions. In the sequel we suppose a is a growing substitution. From Corollary [5] 
it is decidable to know whether we are in this situation. 

We recall that from the previous section we can suppose is a coding and that a 
satisfies (PI), (P2) and (P3). 

Lemma 12. Let u and v be two words. It is decidable to check whether or not (u^) 
is equal to (v u ). 

Theorem 13. The HDOL ultimate periodicity problem is decidable for substitutive 
sequences w.r.t. growing substitutions. 

Proof. Let us use the notation of Proposition 2) From Corollary [71 taking a power 
of a (less that l-Al'" 4 ') if needed, we can suppose that for all i > q + 1 such that 
Mi is neither a null matrix nor the lxl matrix [1], the endomorphism Tj defines 
a primitive sub-substitution. For such i, we denote by cr, the sub-substitution 
corresponding to Aj. 

Observe that for all i > q + 1 and b e A, the word a n (b) — <r™(6) is recurrent 
in a u (a). Thus, to check the periodicity of x, we start checking with Theorem [5] 
that the languages 4>{{o~')), where a' is a sub-substitution of a, are periodic (i.e., 
equal to some (w((r')") where w(o~') is a word period). We point out that when the 
language is periodic then w(a r ) can be computed. If for some a 1 its language is not 
periodic then x is not ultimately periodic. Then, we check that all these periodic 
languages are equal using Lemma [T2l From Lemma [51 if this checking fails, then x 
is not periodic. 

Hence we suppose it is the case : There exists a word v that is algorithmically 
given by Theorem [2] such that 4>((a')) = (v u ) for all sub-substitution a' of a. 
Consequently, we should check whether there exists u such that x — uv u . 
We note that for all n the words of length n can be algorithmically enumerated, 
using for example the substitution on the words of length n in [Qucffclcc 1987). We 
conclude using Lemma [TT] □ 

3.4. The case of substitutive sequences with respect to non-growing sub- 
stitutions. In the sequel we suppose that a is a non-growing substitution. From 
Corollary [5] it is decidable to know whether we are in this situation. We recall that 
from the previous section we can suppose is a coding and that a satisfies (PI), 
(P2) and (P3). 

Lemma 14. [Pansiot 1984, Theoreme 4.1] The substitution a satisfied exactly one 
one the following two statements. 

(1) The length of words (occurring in a" (a)) consisting of non- growing letters 
is bounded. 

(2) There exists a growing letter b G A, occurring in a" (a), such that a(b) = vbu 
(or ubv) with u G C* \ {e} where C is the set of non-growing letters. 

Moreover, in the situation (fT]) the sequence o~ u (a) can be algorithmically defined as 
a substitutive sequence w.r.t. a growing substitution. 
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Lemma 15. It is decidable to know whether a satisfies {TJ or ([2]) of Lemma \TT\ 

Proof. It can be easily algorithmically checked whether we are in the situation @ 
of Lemma Q3J Thus it is decidable to know whether we are in situation ([1]) of 
Lemma [TJ] □ 

Theorem 16. The HDOL ultimate periodicity problem is decidable for substitutive 
sequences w.r.t. non-erasing substitutions. 

Proof. From Theorem 1131 Lemma [14] and Lemma [15] it remains to consider that a 
satisfies $2$ in Lemma [Til: Let b be a letter occurring in a u such that a(b) = vbu (or 
ubv) with u £ C* \ {e} where C is the set of non-growing letters. Then (it") C (a). 
We conclude using Lemma [TO □ 

This ends the proof of Theorem [TJ 
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